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CHOLINE MONOOXYGENASE GENE 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to plant choline monooxygenase which is involved in the 
occurrence of plant injury attributable to dry or saline soil, which is believed to contribute to the 
improvement of plant tolerance to dry or saline soil, and which is induced by dry or saline soil, 
as well as a gene encoding the choline monooxygenase. 

2. Description of the Related Art 

At present, desertification or salt accumulation in cultivated land is progressing in a large 
number of areas on the earth. These environmental changes are considered as serious issues in 
connection with current environmental problems and food problems in the 21 st century. As 
means to solve these problems, the breeding of environmental stress-resistant plants is attracting 
attention together with engineering-based solutions such as irrigation. 

Specific examples of damage caused by salt accumulation may be enumerated as follows: 
(1) because of accumulated salts in soil, moisture potential in soil decreases, which makes it 
impossible for plants to absorb moisture; (2) because of salts which have been absorbed (or 
which have invaded) into plant bodies, plant metabolism is disturbed; and (3) because of 
accumulated salts, absorption of other ions necessary for plant growth is inhibited (Funiihiko 
Sato, Plant Cell Engineering, an extra issue, "Environmental Problems and Biotechnology", pp. 
33-39, 1 994). In particular, inhibition of moisture absorption caused by dehydration, salt 
injury, etc. eventually decreases photosynthesis activity to thereby inhibit the growth of plants 
greatly. 

The existence of adaptation mechanisms against stresses such as dehydration or salts has 
become evident in microorganisms and plants. Among all, compatible solutes (i.e. low 
molecular weight organic compounds or osmoregulating substances) have been investigated 
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vigorously. Compatible solutes are those substances which are characterized by having low 
molecular weights, being rich in water-solubility and difficult to metabolize, and not affecting 
metabolism. As specific examples of compatible solutes, amphoteric compounds such as 
glycine betaine, proline, and polyols such as pinitol, sorbitol, mannitol are known. In 
particular, glycine betaine (hereinafter, referred to as "betaine") is Utilized widely not only in 
higher plants such as chenopodiaceous plants, gramineous plants and solanaceous plants, but 
also in microorganisms. It is reported that this compatible solute is functioning in protecting 
proteins from high temperature stress (Paleg, L.G. et al., Aust. J. Plant Physiol. 8:107-114, 1981; 
Allakhverdiev S.L, J. Photochem. PhotoioL 34:149157, 1996), in mamtaining osmotic pressure 
balance against the environment (Robinson, S.P. and Jones, G.P., Aust. J. Plant Physiol. 13:659- 
668, 1986) and in protecting soluble enzymes from salt stress (Gabbay-Azaria et al., Arch, 
Biochem. Biophys. 264:333-339, 198S). 

In spinach which is well studied among higher plants, betaine is synthesized in two steps 
through choline and betaine aldehyde. Specifically, oxidization in the first step is catalyzed by 
a ferredoxin-dependent choline monooxygenase (Brouquisse, R, et al, Plant Physiol. 90: 322- 
329, 1989), and oxidization in the second step is catalyzed by aNAD-dependent betaine 
aldehyde dehydrogenase (Weretilnylc, EA. et al., Planta 178: 342-352, 1989). It is confirmed 
that when such a plant is exposed to salt Stress, the activity of each of the above enzymes rises 
and the amount of betaine is increased (Hanson, A.D. et al., Proc. Natl. Acad. Sci, U.S.A. 82: 
3678-3682, 1985). 

A choline oxidase obtained from a Gram-negative soil bacterium Arihrobacter 
globiformis is able to oxidize choline to betaine in one step oxidization (Bcuta, S. et aL> J. 
Biochem. 82: 1741-1749, 1977). 

Several attempts have been made to accumulate betaine in plants and confer salt tolerance 
on them by incorporating in plant bodies two enzyme genes from Escherichia coli and a higher 
plant or a choline oxidase gene and allowing the gene constant expression. Accumulation of 
betaine in plant bodies have been reported when Arthrobacter globiformis codA gene 
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(W096/29857), K colt betA gene (Japanese Unexamined Patent Publication No, 10-191983) 
and spinach CMO gene (Nuccio, M.L. et al., The Plant J., 16: 487-496, 1998) were 
incorporated. However, no attempts have succeeded in accumulating betaine at such levels as 
seen in salt tolerant plants. Thus, it is desired to establish a betaine synthesis system which can 
accumulate betaine in plants at high levels, 

OBJECTS AND SUMMARY OF THE INVENTION 

It is the object of the invention to provide a choline monooxygenase, a gene encoding the 
same, a vector comprising the gene, a transformant comprising the vector, a stress resistant 
plant, and a method for inducing betaine accumulation. 

As a result of intensive and extensive researches toward the solution of the above 
problem, the present inventors have isolated a fulHength choline monooxygenase gene 
inducible by dry and saline soil from Chenopodium album L. which exhibits tolerance to dry 
and saline soil and can accumulate betaine at a high rate of about 60 ft mol/g fresh weight 
under salt stress, and found that this gene is capable of accumulating betaine in plant bodies. 
Also, the inventors have found for the first time that a transit peptide sequence of the choline 
monooxygenase gene induces protein accumulation under salt stress. Thus, the present 
invention has been achieved. 

The present invention relates to the following recombinant protein (a) or (b): 

(a) a protein comprising the amino acid sequence shown in SEQ ID NO: 2, 4 or 6; 

(b) a protein which comprises the amino acid sequence shown in SEQ ID NO: 2, 4 or 6 
having deletion, substitution or addition of one or several amino acids, and which has choline 
monooxygenase activity. 

The present invention further relates to a choline monooxygenase gene encoding the 
above-described protein. 

The present invention further relates to a gene comprising the following DNA (c) or (d): 

(c) a DNA comprising the nucleotide sequence shown in SEQ ID NO; 1, 3 or 5; 
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(d) a DNA which hybridizes to a DNA comprising the nucleotide sequence shown in 
SEQ ID NO: 1, 3 or 5 under stringent conditions and which encodes a protein having choline 
monooxygenase activity. 

The present invention further relates to a recombinant vector comprising the above- 
described gene. 

The present invention further relates to a transformant comprising the above-described 
recombinant vector. 

The present invention further relates to a method for producing a choline monooxygenase, 
comprising culturing the above-described transformant and recovering the choline 
monooxygenase from the resultant culture. 

The present invention further relates to the following peptide (e) or (f); 

(e) a peptide comprising the amino acid sequence shown in SEQ ID NO: 17; 

(f) a peptide which comprises the amino acid sequence shown in SEQ ED NO: 17 having 
deletion, substitution or addition of one or several amino acids and which has signal peptide 
activity; or a salt thereof. 

The present invention further relates to a gene encoding the above-described peptide. 
Specific examples of the gene include a gene comprising the following DNA (g) or (h): 

(g) a DNA comprising the nucleotide sequence shown in SEQ ID NO: 16; 

(h) a DNA which hybridizes to a DNA comprising the nucleotide sequence shown in 
SEQ ID NO: 16 under stringent conditions and which encodes a protein having signal peptide 
activity. 

The present invention further relates to a recombinant vector comprising a gene encoding 
the above-described peptide and a gene of interest. As the gene of interest, a gene which leads 
to production of a polypeptide or production of a plant metabolite (e.g. a substance that confers 
stress resistance), or Chenopodium album choline monooxygenase gene may be enumerated. 

The present invention further relates to a transformant comprising the recombinant vector 
comprising a gene encoding the above-described peptide and a gene of interest. Specific 
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examples of the transformant include a plant body, plant organ, plant tissue and cultured plant 
cell. 

The present invention further relates to a method for creating an environmental stress- 
resistant plant, comprising culturing or cultivating a transformed plant comprising the above- 
described recombinant vector under environmental stress (e.g. salt stress) conditions; or an 
environmental stress-resistant plant created by this method. 

The present invention further relates to a method for inducing accumulation of a 
polypeptide or a plant metabolite (e.g. a substance that confers environmental stress resistance), 
comprising culturing or cultivating the above-described transformant under environmental stress 
conditions. As a specific example of the substance that confers environmental stress 
resistance, betaine may be given. 

The present invention further relates to a method for producing betaine, comprising 
culturing or cultivating a transformant comprising the recombinant vector comprising a gene 
encoding the above-described peptide and a choline monooxygenase gene and then recovering 
betaine from the resultant culture or the cultivated product. 

This specification includes part or all of the contents as disclosed in the specification 
and/or drawings of Japanese Patent Application No. U -273275, which is a priority document of 
the present application. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram showing the responsivity of a choline monooxygenase (CMO) transit 
peptide to salt stress. 

Fig. 2 is a graph showing betaine accumulation in a transgenic tobacco. 

DETAILED DESCRIPTION OF THE INVENTION 

Hereinbelow, the present invention will be described in detail. 
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The present invention relates to plant choline monooxygenase gene which is induced by 
dry and saline soil. As one example of such a gene, Ckenopodium album choline 
monooxygenase gene will be described. However, it is believed that other choline 
monooxygenase genes inducible by dry and saline soil exist in other plant species which, like 
Ckenopodium album, exhibit tolerance to dry and saline soil. Thus, choline monooxygenase 
genes derived from plants other than Ckenopodium album are also included in the gene of the 
invention. 

The choline monooxygenase of the invention comprises the amino acid sequence shown 
in SEQ ID NO: 2, 4 or 6, However, these amino acid sequences may have some difference 
among plant varieties. Also, even in the same plant variety, the amino acid sequence of choline 
monooxygenase may be varied because of mutations or the like. Accordingly, a protein which 
comprises the amino acid sequence shown in SEQ ID NO: 2, 4 or 6 having deletion, substitution 
or addition of one or several (e.g. one to ten) amino acids, and which has choline 
monooxygenase activity is also included in the present invention. 

Further, the present invention provides a choline monooxygenase gene comprising the 
nucleotide sequence shown in SEQ ID NO; 1 , 3 or 5, However, the gene of the invention is not 
limited to these genes but includes all of the genes encoding the amino acid sequence shown in 
SEQ ID NO: 2, 4 or 6. The gene of the invention also includes all of the genes encoding a 
substantial choline monooxygenase comprising the amino acid sequence shown in SEQ ID NO: 
2, 4 or 6 having substitution, deletion or addition of one or several (e.g. one to ten) amino acids. 

1 . Cloning of the Gene of the Invention 

The gene of the invention can be isolated by extracting RNA from a plant loaded with 
stress (e.g. dehydration or salt treatment) and then subjecting the resultant RNA to RT-PCR. 
Specific examples of plants which may be used as a source of mRNA include, but are not 
limited to, chenopodkceous plants to which Ckenopodium album belongs. The preparation of 
mRNA may be performed by conventional methods. For example, total RNA may be 
extracted from the above-mentioned source by the guanidium thiocyanate-cesium chloride 
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method or the like, and then po!y(A)+ RNA (mRNA) may be obtained therefrom by affinity 
column method using oligo dT-cellulose or poly U-Sepharose or batch method. The poly(A)+ 
RNA may be fractionated further by sucrose gradient centrifugation or the like. Using the thus 
obtained mRNA as a template, single-stranded cDNA is synthesized with oligo dT primers and 
a reverse transcriptase. Then, double-stranded cDNA is synthesized from the single-stranded 
cDNA. As a pair of primers to be used in RT-PCR, oligonucleotides cojresrxmding to two 
portions of other plant's choline monooxygenase which are highly homologous among plant 
choline monooxygenases may be used (e.g. partial sequences from spinach choline 
monooxygenase). 

Further, a cDNA fragment encoding a part of the choline monooxygenase of interest is 
cloned from the cDNA by RT-PCR. From this cDNA fragment, primers for RACE-PCR are 
prepared. Using these primers, RACE-PCR is performed on a template cDNA to which an 
adaptor is ligated at both ends (RACE). Thus, a cDNA encoding the full-length of the choline 
monooxygenase of interest can be obtained. RACE (Rapid Amplification of cDNA Ends) is a 
method for recovering the 5' or 3' missing portion of a cDNA rapidly. 

More specifically, upon determination of the sequence of the partial cDNA fragment 
obtained by RT-PCR, gene specific primers (GSPs) are designed based on the resultant partial 
cDNA sequence. Gene specific primers are primers necessary for amplifying DNA fragments 
which are located at 5' and 3' flanking regions of the above-mentioned partial cDNA sequence 
and whose sequences are unknown. GSP sequences may be selected arbitrarily from the 
above-mentioned partial cDNA sequence. 

Subsequently, DNA fragments located on the 5' side (upstream) and the 3' side 
(downstream) of the above-mentioned partial cDNA are amplified. Although the sequences of 
these DNA fragments which serve as templates are unknown, an adaptor sequence is ligated to 
one end of each fragment. Then, using a primer which hybridizes to the adaptor (termed 
"adaptor primer (AP) H ) and the above-described GSP as a pair of primers, the adaptor-ligated 
cDNA fragment whose sequence is unknown is amplified. 

In the present invention, RACE may be performed using a commercial kit (Marathon™ 
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cDNA Amplification Kit; Clontech), 

The nucleotide sequences of the resultant cDNA fragments are determined by a method 
based on PCR. For example, a reaction is performed using PRISM Sequencing Kit (Perkin 
Elmer) containing fluorescence dideoxy terminator, followed by determination of the nucleotide 
sequence of the resultant product with an autosequenser (e.g. Model ABI 373; Applied 
Biosystems). 

From the thus obtained partial sequence and nucleotide sequences of 5' and 3' RACE 
products, the nucleotide sequence of the full-length cDNA can be obtained by assembling. 
Briefly, by joining these fragments at the sites overlapping 'with each other, the full-length 
nucleotide sequence containing 5' and 3' ends is obtained. 

SEQ ID NOS: 1, 3 and 5 illustrate nucleotide sequences of the gene of the invention. 
SEQ ID NOS: 2, 4 and 6 illustrate amino acid sequences of the choline monooxygenase of the 
invention. However, as long as a protein comprising one of the above amino acid sequences 
has choline monooxygenase activity, the amino acid sequence may have variation (such as 
deletion, substitution or addition) in one or several amino acids. 

For example, 1 to 10 amino acids, preferably 1 to 5 amino acids of the amino acid 
sequence shown in SEQ ID NO: 2, 4 or 6 may be deleted; 1 to 10 amino acids, preferably 1 to 5 
amino acids may be added to the amino acid sequence shown in SEQ ID NO; 2, 4 or 6; or 1 to 
10 amino acids, preferably 1 to 5 amino acids of the amino acid sequence shown in SEQ ID NO: 
2, 4 or 6 may be substituted with other amino acids. 

The term "choline monooxygenase activity" used in the present invention means the 
activity to catalyze the first step oxidization from choline to betaine aldehyde. The presence or 
absence of the above-described activity of the protein of the invention can be confirmed by 
adding to a crude extract of a plant a solution containing choline chloride and DCPIP, and 
measuring changes in absorbance in the resultant reaction solution (Japanese Unexamined 
Patent Publication No. 10-191983). 

Further, a DNA which hybridizes to the above-described gene under stringent conditions 
is also included in the gene of the invention. The "stringent conditions" means those conditions 
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under which the so-called specific hybrid is formed but non-specific hybrids are not formed. 
For example, those conditions under which highly homologous DNAs (i.e. DNAs having 60% 
homology or more, preferably 80% homology or more) hybridize to each other and DNAs with 
less homology do not hybridize to each other may be given. More specifically, stringent 
conditions means a sodium concentration of 150-900 mM, preferably 600-900 mM, and a 
temperature of 60-68 °C, preferably 65 °C. 

When genes consisting of the nucleotide sequences shown hi SEQ ID NOS: 1, 3 and 5, 
respectively, are designated type A, type B and type C, there are 97.0% homology between type 
A and type B, 98.2% homology between type A and type C, and 97.5% homology between type 
B and type C. Therefore, a gene which comprises a nucleotide sequence having 90%, 
preferably 97% homology or more to type A gene and which encodes a protein having choline 
monooxygenase activity is also included in the gene of the invention. 

Once the nucleotide sequence of the gene of the invention has been deteirained, the gene 
of the invention can be obtained by chemical synthesis, by PCR using the cloned cDNA as a 
template, or by hybridization using a DNA fragment having the nucleotide sequence as a probe. 
Further, a modified DNA encoding the choline monooxygenase may be synthesized by site 
specific mutagenesis or other techniques. 

In order to introduce mutations into genes, known techniques such as the method of 
Kunkel, the gapped duplex method, etc. or techniques based on these methods may be used. 
For example, mutations may be introduced using a mutation introduction kit (e.g. Mutant-K or 
Mutant-G both from Takara) utilizing site specific mutagenesis or LA PCR in vitro Mutagenesis 
series kits (Takara). 

2. Preparation of Recombinant Vectors and Transformants 
( 1 ) Preparation of Recombinant Vectors 

The recombinant vector of the invention can be obtained by ligating (inserting) the 
gene of the invention to (into) an appropriate vector. The vector into which the gene of the 
invention is to be inserted is not particularly limited as long as it is replicable in a host. For 
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example, plasmid DNA, phage DNA or the like may be used. 

Specific examples of plasmid DNA include R co/i-derived plasmid s (e.g. pBR322, 
pBR325, pUC118, pUC119, etc.), Bacillus subtilis-denved plasmids (e.g. pUBHO, pTPS, etc.) 
and yeast-derived plasmids (e.g. YEpl3, YEp24, YCp50, etc.), and specific examples of phage 
DNA include X phage (Charon4A, Charon2lA, EMBL3, EMBL4, AgtlO, JLgtll. X ZAP, 
etc.). Further, an animal virus vector such as retrovirus or vaccinia virus; or an insect virus 
vector such as baculovirus may also be used. 

For insertion of the gene of the invention into a vector, a method may be employed in 
which the purified DNA is digested with appropriate restriction enzymes and then inserted into 
the restriction site or the multi-cloning site of an appropriate vector DNA for ligation. 

The gene of the invention should be incorporated into the vector so that the function 
thereof can be manifested. For this purpose, the vector of the invention may contain, if 
desired, cis elements such as an enhancer, splicing signals, poly(A) addition signals, selection 
markers, ribosome binding sequences (SD sequences) or the like in addition to a promoter and 
the gene of the invention. As the selection marker, dihydrofolate reductase gene, ampicillin 
resistance gene, neomycin resistance gene or the like may be enumerated. 
(2) Preparation of Transformants 

The transformant of the invention can be obtained by introducing the recombinant 
vector of the invention into a host so that the gene of interest can be expressed. The host useful 
in the invention is not particularly limited as long as it can express the DNA of the invention. 
Specific examples of the host include Escherichia bacteria such as Escherichia coli; Bacillus 
bacteria such as Bacillus subiilis; Psendomonas bacteria such as Pseudomonas putida; 
Rhizobium bacteria such as Rhizobium meliloti; yeast such as Saccharomyces cerevisiae, 
Schizosaccharomyces pombe; animal cells such as COS cells, CHO cells; or insect cells such as 
Sf9 cells. 

When a bacterium such as E. coli is used as the host, the recombinant vector of the 
invention should be capable of autonomous replication in this microorganism and, at the same 
time, it is preferred that the vector be composed of a promoter, a ribosome binding sequence, the 
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gene of the invention and a transcription termination sequence. The vector may also contain a 
gene(s) to control the promoter. 

As Escherichia bacteria, E. coli DH5 a or Y 1090 strain may be used, for example. As 
Bacillus bacteria, Bacillus subtilis may be used, for example. However, the present invention 
is not limited to these bacteria. 

As the promoter, any promoter may be used as long as it can direct the expression of the 
gene of interest ra a host such as E. coli . For example, an E. coli- or phage-derived promoter 
such as trp promoter, lac promoter, P L promoter or P R promoter may be used. An artificially 
designed and altered promoter such as tac promoter may also be used. 

As a method for introducing the recombinant vector into a bacterium, any method of 
DNA transfer into bacteria may be used. For example, a method using calcium ions [Cohen, 
S.N. et al., Proc. Natl. Acad. ScL, USA, 69:2110-2114 (1972)], electropoxation or the like may 
be used. 

When yeast is used as the host, Saccharomyces cerevisiae^ Schizosaccharomyces pombe, 
Pichia pastoris or the like may be used. In this case, the promoter to be used is not particularly 
limited. Any promoter may be used as long as it can direct the expression of the gene of 
interest in yeast. For example, gall promoter, gall 0 promoter, heat shock protein promoter, 
MF a 1 promoter, PH05 promoter, PGK promoter, GAP promoter, ADH promoter, AOXl 
promoter or the like may be used. 

As a method for introducing the recombinant vector into yeast, any method of DNA 
transfer into yeast may be used. For example, electroporation [Becker, D.M., Methods 
Enzymol., 194:182-187 (1990)], the spheroplast method [Hinnen, A. et al., Proc. Natl. Acad. 
Sci., USA, 75:1929-1933 (197$)], the lithium acetate method [Itoh, H., J. Bacterid., 153:163- 
168 (1983)] or the like may be employed. 

When an animal cell is used as the host, simian COS-7 or Vero cells, Chinese hamster 
ovary cells (CHO cells), mouse L cells, rat GH3 cells, human FL cells or the like may be used. 
As a promoter, SR a promoter, SV40 promoter, LTR promoter, CMV promoter or the like may 
be used. The early gene promoter of human cytomegalovirus may also be used. As a method 
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for introducing the recombinant vector into the animal cell, electroporation, the calcium 
phosphate method, lipofection or the like may be employed. 

When an insect cell is used as the host, Sf9 cells or the like may he used. As a method 
for introducing the recombinant vector into the insect cell, the calcium phosphate method, 
lipofection, electroporation or the Hke may be employed. 

When a plant is used as the host, a transformant may be prepared as described below. 

In the present invention, a plant to be transformed may be any of the following plant 
materials: entire plant bodies, plant organs (e.g. leaves, petals, stems, roots, seeds, etc.), plant 
tissues (e.g. epidermis, phloem, parenchyma, xylene vascular bundles, palisade tissues, spongy 
tissues, etc.) or cultured plant cells. Specific examples of plant species which may be used for 
transformation include, but are not limited to, those belonging to the genus Chenopodiaceae, 
Solanaceae, Gramineae> Leguminosae, Rosaceae, Compositae, Liliaceae, Caryophyllaceae, 
Cucurbitaceae, Convolvulaceae or Cruciferae. 

The above-described recombinant vector may be introduced into a plant by conventional 
transformation methods, e.g. the Agrobacterium method, the particle gun method, PEG method, 
electroporation, etc. For example, when the Agrobacterium method is used, a plant expression 
vector constructed is transferred into an appropriate Agrobacterium strain (e.g. Agrobacterium 
tumefaciens LBA4404), followed by infection of aseptically cultured leaf discs of a host (e.g. 
tobacco) with this strain according to the leaf disc method (Hhobumi Uchimiya, Operation 
Manual for Plant Genes, 1 990, pp. 27-3 1, Kohdansha Scientific Co., Ltd., Tokyo). Thus, a 
transformed tobacco is obtained. 

When the particle gun method is used, entire plant bodies, plant organs or plant tissues 
may be used as they are, or may be used after preparation of pieces or protoplasts. The thus 
prepared samples may be bombarded using a gene transfer apparatus (e.g. PDS-1000; BioRad). 
Bombardment conditions vary depending on the type of the plant or sample. Usually, the 
sample is bombarded under a pressure of about 450-2000 psi and at a distance of 4-12 cm. 

When a cultured plant cell is used as the host, transformation is performed by introducing 
the recombinant vector thereinto by the particle gun method, electroporation or the like. 
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Tumor tissues, shoots, haiiy roots, etc. resulted from the traasforraation can be used 
directly in cell culture, tissue culture or organ culture* Further, they can be regenerated to plant 
bodies by using conventional plant tissue culture methods and administering plant hormones 
(e.g. auxin, cytokmin, gibberellin, abscicic acid, ethylene, brasinolide) at appropriate 
concentrations. 

Whether the gene of interest has been integrated into the host or not can be confirmed by 
PCR, Southern hybridization, Northern hybridization or the like. For example, DNA is 
prepared from the transformant and then DNA specific primers are designed for PCR. A PCR 
reaction may be performed under the same conditions as described above in the preparation of 
plasmids. Subsequently the amplified product is subj ected to agarose gel electrophoresis, 
polyacrylamide gel electrophoresis or capillary electrophoresis and stained with etbidium 
bromide, SYBR Green solution, etc. By detecting the amplified product as a single band, it 
can be confirmed that the host has been transformed. Alternatively, a PCR reaction may be 
performed using primers labelled with a fluorescent dye or the like, and then the amplified 
product may be detected. Further, a method may be employed in which a PCR amplified 
product is bound to a solid phase such as a microplate, and then the product is confirmed by 
fluorescence or enzyme reaction. 

3 . Production of the Protein of the Invention 

The protein of the invention is a protein comprising the amino acid sequence encoded by 
the choline monooxygenase gene of the invention; or a protein which comprises the above 
amino acid sequence having the above-described mutation in a plurality of amino acids and yet 
which has choline monooxygenase activity. In this specification, the protein of the invention is 
sometimes called the "choline monooxygenase protein". 

The choline monooxygenase protein of the invention can be obtained by culturing the 
above-described transformant in a medium and recovering the protein from the resultant culture. 
The term "culture" means any of the following materials: culture supernatant, cultured cells or 
microorganisms, or disrupted cells or microorganisms. 
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The culturing of the transformant of the invention is carried out by conventional methods 
used for culturing hosts. 

As a medium for culturing the transformant obtained from a microorganism host such as 
E. coli or yeast, either a natural or synthetic medium may be used as long as it contains carbon 
sources, nitrogen sources and inorganic salts assimilable by the microorganism and is capable of 
efficient culturing of the transformant. 

As carbon sources, carbohydrates such as glucose, fructose, sucrose, starch; organic acids 
such as acetic acid, propionic acid; and alcohols such as ethanol, propanol may be used. 

As nitrogen sources, ammonium salts of inorganic or organic acids (e.g. ammonia, 
ammonium chloride, ammonium sulfate, ammonium acetate, ammonium phosphate, etc.) and 
other nitrogen-containing compounds (e.g. Peptone, meat extract, corn steep liquor, etc) may be 
used. 

As inorganic substances, potassium dihydrogen phosphate, dipotassium hydrogen 
phosphate, magnesium phosphate, magnesium sulfete, sodium chloride, iron(II) sulfate, 
manganese sulfate, copper sulfate, calcium carbonate and the like may be used. 

Usually, the culture is carried out under aerobic conditions used in shaking <nilture or 
aeration agitation culture, at 37^. Adjustment of the pH of the medium is carried out with an 
inorganic or organic acid, an alkali solution or the like. 

During the culture, antibiotics such as ampicillin or tetracycline may be added to the 
medium if necessary. 

When a microorganism transformed with an expression vector containing an inducible 
promoter is cultured, an inducer may be added to the medium if necessary. For example, when 
a microorganism transformed with an expression vector containing Lac promoter is cultured, 
isopropyl- B -D-thiogalactopyranoside (IPTG) or the like may be added. When a 
microorganism transformed with an expression vector containing tip promoter is cultured, 
indoleacetic acid (IAA) or the like may be added. 

As a medium for culturing the transformant obtained from an animal cell as a host, 
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commonly used RPMI1640 medium or DMEM medium, or one of these media supplemented 
with fetal calf serum, etc. may be used. Usually, the culture of such a transformant is carried 
out in the presence 5% CO2 at 37°C for 1 to 30 days. During the culture, antibiotics such as 
kanamycin or penicillin may be added to the medium if necessary. 

After the culture, the choline monooxygenase protein of the invention is extracted by 
disrupting the cultured microorganisms or cells if the protein is produced in the microorganisms 
or cells. If the protein of the invention is produced outside of the microorganisms or cells, the 
culture fluid is used as it is or subjected to centrifugation to remove the microorganisms or cells. 
Thereafter, the resultant supernatant is subjected to conventional biochemical techniques used 
for isolating/purifyinng a protein. These techniques include ammonium sulfate precipitation, 
gel chromatography, ion exchange chromatography and affinity chromatography; these 
techniques may be used independently or in an appropriate combination to thereby isolate and 
purify the protein of the invention from the above culture. 

When the host is a plant, the choline monooxygenase of the invention can be produced by 
culturing or cultivating the transformed plant. Further, it is also possible to produce the 
product of a reaction catalyzed by the choline monooxygenase, or the intermediates and/or the 
final product (e.g. betaine) of a series of biosynthesis reactions following the above reaction. 

When the transfoxmant is a plant cell or plant tissue, culture may be performed in a 
conventional plant culture medium, e.g. MS basal medium (Murasnige, T. & Skoog. R (1962) 
Physiol. Plant. 15: 473), LS basal medium (Linsmaier, E.M. & Skoog, F. (1965) Physiol. Plant 
1 8: 1 00), or protoplast culture medium (modified LS medium). Conventional solid culture 
methods may be used, but it is preferable to use liquid culture methods. 

A transfornied plant cell, tissue or organ is inoculated into the above medium at a rate of 
0. 1 -2 .0 g fresh weight/liter. If necessary, NAA, 2.4-D, B A, kinetin or the like is added to the 
medium appropriately. Then, the transformant is cultured. The pH of the medium at the start 
of culture is adjusted to 5-7. Usually, the culture is performed at 20-30 Q C, preferably at 
around 25 under aeration at 0.2-1 wm and agitation at 50-200 rpm for 1-6 weeks. 

When the transformant is a plant body, it may be cultivated on a field or in a glass house 
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or may be hydroponically cultured. 

In order to recover the protein of the invention from cultured cell or tissues, first, the cells 
are disrupted by cell lysis treatment using an eii2yme such as cellulase or pectinase, sonication, 
grinding or the like. Then, insoluble matters are removed therefrom by filtration, 
centrifugation, etc. to thereby obtain a crude protein solution or a solution containing the 
primary and/or the secondary metabolite of the plant. 

In order to further purify the protein of the invention from the above crude protein 
solution, conventional protein purification methods may be used. For example, ammonium 
sulfate salting out, ion exchange chromatography, hydrophobic chromatography, gel filtration 
chromatography, affinity chromatography or electrophoresis may be used independently or in an 
appropriate combination. 

In order to recover the protein of the invention from plant organs or plant bodies, first, an 
extract of the useful substance is prepared by disrupting the plant organs or bodies by sonication 
or grinding. Subsequently, the above-described purification procedures may be followed. 

4. Transit Peptides 

(1 ) Specification of Transit Peptide Sequences 

The transit peptide of the invention is a peptide comprising the amino acid sequence 
shown in SEQ ID NO; 17. The location of this amino acid sequence can be specified by 
sequence analysis of the cloned choline monooxygenase (CMO) gene. This amino acid 
sequence is encoded by the nucleotide sequence shown in SEQ ID NO: 16. 

Once the amino acid sequence has been known, the transit peptide of the invention may 
be produced by chemical synthesis as described below, 

(2) Chemical Synthesis of Transit Peptides 

The transit peptide of the invention may be produced by conventional peptide synthesis 
techniques based on the amino acid sequence specified as described above. Either the liquid 
synthesis method or the solid synthesis method may be used. Such peptide synthesis may be 
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performed by any of the known methods (see, for example, Bodanszky, M. and M.A. Ondetti, 
Peptide Synthesis, Interscience Publishers, New York (1966); Schroeder and Luebke, The 
Peptide, Academic Press, New York (1965); KM. Finn and K. Hofmann, The Proteins, Vol. 2, 
H. Nenrath and R, L. Hill (eds.), Academic Press Inc., New York (1976); N. Izumiya et al„ 
Basics and Experiments in Peptide Synthesis, Maruzen Co., Tokyo (1985); H. Yajima, S. 
Sakakibara et al, Course of Biochemistry Experiment Lectures No. 1, The Japanese 
Biochemical Society (ed.), Tokyo Kagaku Dojin Co., Tokyo 1977; T. Kimura, 2nd Series: 
Course of Biochemistry Experiment Lectures No. % The Japanese Biochemical Society (ed.), 
Tokyo Kagaku Dojin Co., Tokyo 1987). Thus, the transit peptide of the invention can be 
obtained by, for example, the azide method, the acid chloride method, the acid anhydride 
method, the mixed acid anhydride method, the DCC method, the active ester method, the 
method using Woodward's reagent K, the carbonylimidazole method, the oxidation-reduction 
method, the DCC/HONB method, or the method using BOP reagent. Usually, the transit 
peptide may be synthesized with a commercial, automated peptide synthesizer. 

The transit peptide of the invention can be prepared by ligating a peptide of interest to a 
peptide fragment of the transit peptide by condensation and then removing the protecting groups 
of the C-tenninal a -carboxyl and N-terminal a -amino groups of the resultant product at the 
same time or in a stepwise manner. 

After completion of the reaction, the thus prepared peptide can be recovered by a 
combination of peptide separation/purification techniques such as solvent extraction, distillation, 
partition, reprecipitation, recrystallization, column chromatography, high performance liquid 
chromatography, gel filtration, ion exchange chromatography and ion exchange 
chromatography. 

The transit peptide of the invention may be obtained in the form of a metal salt; a salt 
made of the peptide and a base or basic compound; an inorganic acid addition salt; an organic 
salt; or the like. In particular, the transit peptide of the invention can be obtained as a 
pharmaceutically acceptable acid addition salt (e.g. a salt made of the peptide and an inorganic 
or organic acid). Specific examples of acid addition salts include salts made of the peptide and 
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inorganic acids such as hydrochloric acid, phosphoric acid, hydrobromic acid, sulfuric acid; or 
salts made of the peptide and organic acids such as acetic acid, formic acid, propionic acid, 
fumaxic acid, maleic acid, succinic acid, tartaric acid, citric acid, malic acid, oxalic acid, benzoic 
acid, methanesulfonic acid, benzenesulfonic acid. Specific examples of basic salts include 
salts made of the peptide and inorganic bases such as sodium hydroxide, potassium hydroxide, 
ammonium hydroxide, magnesium hydroxide; or salts made of the peptide and organic bases 
such as cafifeine, piperidine, tximemylamine, pyridine. Specific examples of metal salts include 
sodium salts, potassium salts, calcium salts and magnesium salts. 

The salt of the transit peptide of the invention can be prepared using an appropriate acid 
such as hydrochloric acid or an appropriate base such as sodium hydroxide. For example, the 
peptide may be treated with such an acid or base in water or a liquid containing an inactive, 
water-rniscible organic solvent such as methanol, ethanol or dioxane according to a standard 
protocol to thereby prepare a salt. The treatment temperature may be from 0 to 100°C; room 
temperature is preferable. 

The biochemical and physicochemical "properties of the transit peptide of the invention 
can be analyzed by mass spectrometric analysis, nuclear magnetic resonance, electrophoresis, 
high performance liquid chromatography, etc. 

5 . Construction of a Complex Composed of Gene of Interest and Transit 
Peptide Gene, and Induction of Substance Accumulation 

In the present invention, a gene encoding the transit peptide (i.e. peptide having a function 
as a signal peptide) of the invention is ligated upstream of a gene of interest. A DNA complex 
can be obtained by digesting the gene of interest and the transit peptide gene with appropriate 
restriction enzymes and then ligating these two genes to each other using ligase. The thus 
ligated DNA is ligated to a vector predigested with appropriate restriction enzymes to thereby 
obtain a recombinant vector. The resultant vector is introduced into a host to thereby obtain a 
transformant. By culturing or cultivating the resultant transformant, an expression product of 
the gene of interest or a metabolite generated through the metabolism of the above expression 
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product by the host can be accumulated. Construction of the recombinant vector, selection of 
the host and transformation may be performed in the same manner as described above for the 
choline monooxygenase of the invention. 

As the gene of interest, a gene encoding a polypeptide or enzyme may be given, but other 
genes may also be used. When the expression product of interest is an enzyme, the product of 
a reaction catalyzed by the enzyme or the intermediates and/or the final product (primary or 
secondary metabolite of plants) of a series of biosynthesis reactions following the above 
reaction may be accumulated as useful substances. Specific examples of such useful 
substances include a substance that confers environmental stress resistance, e.g. betaine. 
Further, when the gene of interest is a control gene (also called "master gene") in charge of the 
functional regulation of an entire reaction pathway (such as biosynthesis pathway) in 
mechanisms for transducing signals from phosphatases or G-protem, the intermediates and/or 
final product of the reaction pathway located downstream of the signal transduction of the above 
control gene may also be accumulated as useful substances. These substances may or may not 
be involved in environmental stress resistance. 

When the transformant (in particular, transformed plant) thus obtained is cultured or 
cultivated under environmental stress conditions, an expression product of the gene of interest or 
a plant metabolite from the expression product is accumulated in the plant upon receipt of the 
environmental stress as a signal. Specific examples of environmental stresses include salt 
stress, dehydration stress, low temperature stress and high temperature stress. 

Salt stress is loaded by adding sodium chloride to a specific hydroponic solution to give a 
concentration of 50-600 mM and culturing the transformed plant therein under usual conditions. 

Dehydration stress is loaded by vdmdrawing the entire plant body from the soil or 
hydroponic solution and exposing it to the air, or by adding polyethylene glycol or the like to the 
hydroponic solution or medium. 

High temperature or low temperature stress is loaded by rising or lowering the 
temperature of the incubator, green house, etc. in which the transformed plant is cultured or 
cultivated. 
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After loaded with stress, the plant is cultured or cultivated in the same manner as 
described earlier for the culturing of the transformant, to thereby obtain an environmental stress 
resistant plant. The term "environmental stress resistant" means the state of a plant that does 
not wither even under conditions which wither non-resistant plants or that is able to grow even 
under conditions which terminate the growth of non-resistant plants, when a particular stress 
(e.g. salt stress, dehydration stress) has been loaded. 

In the present invention, it is possible to allow the expression of a choline 
monooxygenase-transit peptide complex by Hgating a DNA encoding a transit peptide (SEQ ID 
NO: 16) to a DNA encoding choline monooxygenase (SEQ ID NO: 1, 3 or 5) as a gene of 
interest and incorporating the resultant construct into an expression vector. In this case, a 
choline monooxygenase gene with a DNA encoding a transit peptide is incorporated in the 
environmental stress resistant plant of the invention. When this plant is cultivated under an 
environmental stress as described above, accumulation of choline monooxygenase is induced. 
As a result, synthesis of betaine aldehyde from choline is catalyzed and, finally, betaine is 
accumulated in the plant. Such accumulation of betaine is significant in a sense that it can 
confer environmental stress resistance on the plant For recovering betaine from plants, 
methods for purifying quaternary ammonium compounds may be employed. 



PREFERRED EMBODIMENTS OF THE INVENTION 



Hereinbelow, the present invention will be described more specifically with reference to 
the following Examples. However, the technical scope of the invention is not limited to these 
Examples. In the Examples, choline monooxygenase is expressed as "CMO" or "CMO 
protein" and choline monooxygenase gene as "cmo" or "cmo gene". 



EXAMPLE 1 
Preparation of RNA 
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Chenopodium album mature leaves (4 g) immediately after harvest were disrupted in 
liquid nitrogen with a blender. To the disrupted leaves, 20 ml of a guanidine thiocyanate 
solution (4.2 M guanidme thiocyanate, 25 mM sodium citrate dihydrate; immediately before 
use, 7 ix 1 of 2-mercaptoethanol and 5 mg of sodium lauroyl sarcosinate are added per milliliter 
of the solution) was added and shaken vigorously for 1 0 ruin at room temperature. The 
resultant mixture was centrifuged at 1 0,000 rpm for 10 inin to obtain a supernatant, to which 1 g 
of CsCl was added per 2 ml. This supernatant (6-7 ml) was overlayered upon 4 ml of 5.7 M 
CsCl solution (5.7 M CsCl, 0.1 M EDTA (pH 7.5)) contained in an polyalomer tube and ultra- 
centrifuged at 35,000 rpm at 20 X> for 1 8 hr. 

The resultant precipitate was dissolved completely in 5 ml of Tris-SDS solution (50 mM 
Tris-HCl (pH 9.0), 1 % SDS). To this solution, 5 ml of phenol (pH 9.0) was added and shaken 
at room temperature for 1 0 min. Then, the resultant solution was centrifuged at 5,000 rpm at 
20°C for 10 rnin. To the resultant supernatant, 5 ml of phenol/chloroform was added and 
shaken at room temperature for 1 0 min. Then, the resultant solution was centrifuged at 5,000 
rpm at 20 X} for 1 0 min. To the resultant supernatant, 5 ml of chloroform was added and 
shaken at room temperature for 1 0 min. Then, the resultant solution was centrifuged at 5,000 
rpm at 20"^ for 1 0 min. To the resultant supernatant, 1/10 volume of 3 M NaOAc was added. 
Then, 2 volumes of EtOH was added thereto and mixed. The mixture was left at -20°C for 30 
min, followed by centrifugation at 10,000 rpmat4°C for 10 min. The resultant precipitate 
was dissolved in 1 ml of H2O to thereby obtain RNA from Chenopodium album mature leaves. 

EXAMPLE 2 
Acquisition of a Gene Fragment by RT-PCR 
and Cloning of the Full-Length Gene 

(1) Acquisition of a Gene Fragment by RT-PCR 

A gene fragment was obtained using RT-PCR Kit (Stratagene) according to the protocol 
attached to the kit basically. PCR primers were designed based on the amino acid sequence of 
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spinach. CMO. 

Chenopodium album RNA (9.5 \i g) was dissolved in 38 p. 1 of DEPC-treated H 2 0. 
Three microliters of random primer (100 ng/ y. 1) was added thereto, followed by incubation at 
65°C for 5 min. Then, the solution was cooled slowly to room temperature. To this solution, 
5 fi 1 of 1 Ox 1 st strand buffer; 1 u 1 of RNase Block Ribomiclease Inhibitor (40 U/ n 1), 2 p,l 
of 1 00 mM dNTPs and 1 u 1 of MMLV reverse transcriptase (50 U/ a 1) were added and reacted 
at 37 °C for 1 nr. After the reaction, the reaction solution was incubated at 90 °C for 5 min 
and then placed on ice. Using 5 /z 1 of the thus obtained 1 st strand cDNA solution as a 
template, the following PCR reaction solution was prepared and incubated at 91 °C for 5 min 
and at 54 a C for 5 min. 

Composition of the PCR Reaction Solution: 



1st strand cDNA solution 5 jul 

1 Ox Ex Taq buffer (Takara) 10 /xl 

dNTPs mix (2.5 mM each) 8 jul 

Primer 1 (SEQ ID NO: 7) 1 00 pmol 

Primer 7. (SF.0 TP NO: 8) 100 pmol 

Total volume 99.5 #1 



Subsequently, 0.5 p. 1 of Takara Ex Taq DNA polymerase (5 U/ n 1) was added to the 
reaction solution. Then, a PCR reaction was performed 30 cycles, one cycle consisting of 
denaturation at 91 X> for 1 min, annealing at 54 ^ for 1 min and extension at 72t for 2 
mm. After the reaction, the reaction solution was subj ected to agarose gel electrophoresis. A 
fragment of approx. 600 bp which was believed to be the product of interest was cut out from 
the gel, purified, cloned into pT7Blue T-Vector (Novagen) and sequenced to thereby obtain a 
Chenopodium album cmo gene fragment. 

(2) Purification of mRNA 

Purification of mRNA was performed using mRNA Purification Kit (Pharmacia) and 
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according to the protocol attached to the kit. Briefly, 0.9 mg of total RNA from Chenopodium 
album was dissolved in 1 ml of an elution buffer. The solution was heated at 65 t for 5 min 
and then immediately ice-cooled. To this solution, 0.2 ml of a sample buffer was added, and 
the resultant solution was applied to an oligo (dT)-celIulose spin column pre-equilibrated with a 
high-salt buffer. After elution, the column was centrifuged at 3 50xg for 2 mm. Subsequently, 
the column was washed by adding thereto 0.25 ml of a high-salt buffer and centrifuging at 
350xg for 2 min; these washing operations were performed twice. Then, similar washing 
operations were performed 3 times using 0.25 ml of a low-salt buffer. 

RNA was recovered by repeating the following operations 4 times: addition of 0.25 ml of 
an elution buffer preheated to 65"C and centrifugation at 350xg for 2 min. The resultant RNA 
solution (1 ml) was column-purified again in the same manner as described above. To 1 ml of 
the resultant RNA solution, 100 ti 1 of a sample buffer, 10 fi 1 of a glycogen solution and 2.5 
ml of EtOH were added, and the resultant solution was left at -20^ for 2 hr. Then, the 
solution was centrifuged at 14,000 rpm at 4 'C for 10 min. The precipitate was dissolved in 
20 fi 1 of H 2 0, followed by the determination of absorbance. 

Thus, 21 MgofmRNA was obtained. 

(3) Synthesis of cDNA 

(3-1) Synthesis of 1st Strand cDNA 

Water was added to 1 (J. g of Chenopodium album mRNA and 1 u 1 of cDNA synthesis 
primer (10 juM)togivea5 \x 1 solution, which was heated at 70*C for 2 min and immediately 
cooled on ice for 2 min. To this solution, 2 fi 1 of 5x 1 st strand buffer, 1 it 1 of dNTPs mix 
(10 mM), 1 fi 1 of MMLV reverse transcriptase (1 00 U/ a 1) and 1 n 1 of H 2 0 were added. 
After heating at 42t for 1 hr, the solution was immediately cooled on ice. 

(3-2) Synthesis of 2nd Strand cDNA 

Ten microliters of the 1st strand reaction solution, 16 n 1 of 5x 2nd strand buffer, 1.6 fx 1 
ofdNTPsmix(10mM),4 fi 1 of 20x 2nd strand enzyme cocktail and 48.4 julofH 2 Owere 
mixed gently on ice, and the mixture was incubated at 16°C for 45 min. Then, the reaction 
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was terminated by adding thereto 4 it 1 of a mixture of EDTA and glycogen. To the reaction 
solution, 100 u 1 of a mixture of phenol:chloroform:isoamyl alcohol (25:24:1) was added and 
vortexed. Then, the solution was centrifuged at 14,000 rpm for 1 0 min. To the resultant 
supernatant, 100 ti 1 of a mixture of phenol:isoamyl alcohol (24:1) was added and vortexed, 
followed by centrifugation in the same manner as described above. To the resultant 
supernatant, 1/2 volume of 4 M ammonium acetate and 2.5 volumes of EtOH were added, 
followed by centrifugation at 14,000 rpm for 20 min. The precipitate was rinsed with 80% 
EtOH, vacuum-dried and then dissolved in 1 0 \i 1 of H 2 0. Two microliters of this solution 
was subj ected to 0.8% agarose gel electrophoresis for confirmation. Thus, double-stranded 
(ds) cDNA was obtained. 

To 5 //lofthedscDNA,2 il 1 of Marathon cDNA adaptor (10 //M),2 #lof5xDNA 
ligation buffer and 1 1 # 1 of T4 DNA ligase (1 U/ jj, 1) were added arid incubated at 1 6 
overnight After deactivation of the ligase by heating at 70°C for 5 min, 1 jti 1 of the reaction 
solution was diluted with 250 ix 1 of Tricine-EDTA buffer. The diluted solution was heated at 
94^ for 2 min and then cooled on ice for 2 min, to thereby obtain an adaptor-ligated cDNA for 
use m PACE PCfL 

(4) 5'and3'RACE-PCR 

On 5 m 1 of the adaptor-ligated cDNA as a template, a PGR was performed using 
Advantage Klen Taq polymerase (Clontech). Then, 5 a I of the reaction solution was 
subjected to 0.8% agarose gel electrophoresis for confirmation of the amplified product. 

For 5' RACE-PCR, primer 3 (SEQ ID NO: 9) was used. For 3' RACE-PCR, primer 4 
(SEQ ID NO: 10) was used. 

Composition of the PGR Reaction Solution: 



H 2 0 

lOmM dNTPsmix 

5 Ox Klen Taq polymerase mix 

1 Ox Klen Taq buffer (Clontech) 



36 ix 1 
1 nl 
1 jil 
5 ti\ 
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Adaptor-ligated cDNA 

10 ti M API primer (Clontech) 

10 n M Primer (primer 3 or primer 4") 



5 Ml 
1 



Total: 50 n 



PCR conditions were as follows: first denaturation at 94 °C for 1 min, then 5 cycles of 
reaction at 94 °C for 30 sec and at 72 *C for 4 mim then 5 cycle of reaction at 94°C for 30 sec 
and at 70 °C for 4 rnin; and finally 25 cycles of reaction at 94 for 30 sec and at 68 t for 



An approx. 1 .3 kbp band which was believed to be the 5' RACE product and an approx. 
1 .2 kbp band which was believed to be the 3 ' RACE product were comirmed. Each of these 
bands was cut out from the agarose gel, purified and cloned into pTTBlue T-vector. The 
nucleotide sequence of each clone was determined by the dye^erminator method and analyzed. 
As a result, it was found that three cmo genes of type A (SEQ ID NO: 1), type B (SEQ ID NO: 
3) and type C (SEQ ID NO r 5) exist. 

The amino acid sequences encoded by type A, type B and type C genes are shown in SEQ 
ID NOS: 2, 4 and 6, respectively. 

(5) Acquisition of the Full-Length Type C cmo Gene 

Among the three cmo genes, the type C gene was selected for future analysis. Then, the 
inventors isolated the full-length nucleotide sequence of type C cmo gene. 

Briefly, a PCR reaction was performed using a Smal site-added primer (primer 5)(SEQ H> 
NO: 11), an Xbal site-added primer (primer 6)(SEQ ID NO: 12) and KOD DNA polymerase 
(Toyobo), and the amplified product was ligated to pT7Blue T-vector (Novagen). The PCR 
was performed 30 cycles, one cycle consisting of denaturation at 94 °C for 1 min, annealing at 
60 °C for 1 min and extension at 72°C for 2 min. 
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Composition of the PCR Reaction Solution: 



Primer concentration 20 pmol 



2 fflM dNTPs 
Adaptor-Hgated cDNA 
25mMMgCl 2 

KOD DNA polymerase (2.5 U/ ju l)(Toyobo) 
lOx PCR buffer fTovobo') 



5 



Ml 
III 

jlL 



10 



2 



1 



5_ 



Total: 50 \i 



The nucleotide sequence of the amplified product was determined by the dye^terminator 
method to thereby confirm that the product was type C gene. This was designated "pTTcmo". 

(6) Preparation of Antibodies to CMO Protein 

In order to analyze the expression of CMO protein, antibodies to CMO protein were 
prepared using Xpress System (Invitrogen). 

Briefly, 5' primer (primer 7)(SEQ ID NO: 13) to which BamHI site was added and V 
primer (primer 8)(SEQ ID NO: 14) to which a Kpnl site was added were prepared for 
amplifying the code region of the protein excluding the transit peptide. Using these primers,, a 
PCR was performed to thereby amplify an approx. 1 .2 kbp fragment. The PCR was performed 
30 cycles, one cycle consisting of denaturation at 94°C for 1 min, annealing at 60 for 1 min 
and extension at 72X: for 2 min. 



Composition of the PCR Reaction Solution: 



H 2 0 


78 


Ml 


4 mM dNTPs mix 


8 


fJ-l 


Ex Taq (5 U/ n 1) (Takara) 


0.5 


H\ 


1 Ox Ex Taq buffer (Takara) 


10 


111 


pT7cmo(l ng///l) 


1 


Ml 


10 u M Primer (primer 7 + primer 8) 







Total: 100 ju 1 
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The amplified product was ligated to pT7Blue T-vector (Novagen) and designated 
"pT7cmoA". pT7cmoA and pTrcHis were digested with restriction enzymes BamHI and Kpnl, 
and separately subjected to 0.8% agarose gel electrophoresis. Using Gene Clean Spin Kit (BIO 
101), a cmo gene fragment of approx. 1.2 kbp and a vector fragment of approx. 4.4 kb were 
recovered from the gel according to the manual attached to the kit. After purification, these 
two fragments were ligated to each other in a reaction system of 50 v 1, using DNA Ligation 
Kit (Takara) utilizing T4 DNA ligase, and the thus ligated DNA is referred to as pTHC. The 
ligated DNA was introduced into E. coli (JM 109; Takara) to thereby prepare a fusion protein 
expression vector pTHC- 

The thus prepared pTHC was purified by the miniprep method and introduced into E. coli 
(TOP 1 0; Invitrogen) according to the manual attached to TOP 1 0. The pTHC-introduced E. 
coli was cultured according to the manual attached to Xpress System (Invitrogen). From 400 
ml of the resultant culture liquid, histidine-labelled CMO protein was prepared and applied to a 
Probond resin column (Invitrogen). Thus, approx. 1 0 mg of the protein was immobilized in the 
column. From this column, 350 mM imidazole fraction was recovered, followed by removal 
of imidazole with PB- 1 0 (Pharmacia). The resultant solution was applied to a Probond resin 
column (Invitrogen) again to immobilize the protein. Two hundred units of enterokinase was 
mixed with approx. 4.5 mg of the immobilized fusion protein and shaken at room temperature 
for 10 hr. In 4 ml of the eluate from this column, the presence of a 43 kDa protein (equivalent 
to 1.6 mg of CMO mature protein) was confirmed by SDS-PAOE. 

After purifications this CMO protein was administered to rabbits as antigen to prepare 
anti-sera, which were used as antibodies. 

EXAMPLE 3 
Construction of Expression Vectors 

For the purpose of expressing Chenopodium album cmo gene in tobacco, two types of 
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expression vectors were prepared. One was pBIcmo comprising the DNA shown in SEQ ID 
NO: 16 encoding the transit peptide (SEQ ID NO: 17), and the other was pBlcmoS not 
comprising the DNA encoding the transit peptide. 

First, a Smal site-added primer (primer 9) (SEQ ID NO: 15) was prepared in order to 
amplify a cmo gene sequence without the region encoding the transit peptide. 

Using this primer as 5' primer and primer 6 (SEQ ID NO: 12) as 3' primer, a PCR reaction 
was performed to amplify a gene fragment without the region encoding the transit peptide. 
The amplified fragment was Hgated to pT7Blue T-vector (Novagen), which was designated 
pT7cmoS. The PCR was performed 30 cycles, one cycle consisting of denaturation at 94 °C 
for 1 min, annealing at 60 °C for 1 min and extension at 72°C for 2 min. 

Composition of the PCR Reaction Solution: 



pT7cmo and pT7cmoS were separately digested with restriction enzymes SacI and Smal, 
and subjected to agarose gel electrophoresis. Then, an approx. 1 .2 kbp band and an approx. 1 A 
kbp band were cut out from the gel and purified. pBI121 (purchased from Clontech) was 
digested with restriction enzymes SacI and Smal, and then an approx. 1 1 kbp band was cut out 
and purified in the same manner. This band was ligated to each of the above-mentioned 
fragments to thereby prepare pBlcmo and pBlcmoS. 



H 2 0 

4 mM dNTPs mix 

Ex Taq (5 U/ it 1) (Takara) 

lOx Ex Taq buffer (Takara) 

pT7cmo (1 ng//£ 1) 

10 n M Primer (primer 9 + primer 6) 



78 til 
8 txl 
0.5 Ail 
10 ill 

1 jLil 



Total: 100 A 1 



EXAMPLE 4 



Gene Transfer into Tobacco 
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( 1 ) Transformation ofAgrobacterium tumefaciens 

Agrabacterium tumefaciens LBA4404 (purchased from Clontech) was cultured in L 
medium containing 250 Mg/ml streptomycin an 50 ju g/ml rifampicin at 28 °C, A cell 
suspension was prepared from the culture according to the method of Nagel et al. (Microbiol. 
Lett., 67:325, 1 990). Then, pBIcmo and pBIcmoS were separately introduced into the above- 
mentioned strain. 

(2) Transfer of the Polynucleotide Encoding CMO into Tobacco Cells 

Using the transformed Agrobacterium tumefaciens obtained in (1) above, transformation 
of Nicoiiana tabacum cv. SRI was performed according to the method of Horschet, et al. 
(Science, 277: 1229-1231, 1985). 

In each of the following Examples, those lines which were homozygous with respect to 
the transgene were selected from Rl generation of the cmo gene-transferred tobacco that had 
been obtained through redifferentiation, and used in experiments. In the cmo gene-transferred 
tobacco as obtained above, CMO protein is constantly expressed since the polynucleotide of the 
invention encoding CMO is under the control of CaMV35S promoter which is a high expression 
promoter. 

EXAMPLES 
Method of Tobacco Cultivation 

Tobacco seeds were sterilized by shaking them in 40 ml of 10% aqueous solution of 
sodium hypochlorite (Nacalai Tesque) supplemented with 10 fi 1 of Triton X-100 for 10 min and 
then washed with 500 ml of sterilized water in parts. The sterilized tobacco seeds were grown 
in 1/2 MS medium (sucrose concentration: 1,5%), which is a medium containing one half (1/2) 
of each of the components of Murashige and SJcoog (MS) medium (Murashige et al,, Physiol. 
Plant. 1 5:473-497, 1 962), at 25°C under conditions of 16 hr light/8hr dark for 2 weeks. 
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Thereafter, the resultant seedlings were transplanted to square-shaped pots containing 1/2 MS 
medium and subjected to various experiments. 

( 1 ) Western Blot Analysis 

The expression of CMO protein was examined at the protein level using the above- 
described cmo gene-transferred tobacco and a wild-type, non-recombinant tobacco SRI. 

Briefly, completely unfolded upper leaves (0.2 g) were taken from cmo gene-transferred 
tobacco (pBIcmo 4-2 line andpBIcmoS 53-1 line) and non-recombinant Nicotiana tabacum cv. 
SRI , The sample was disrupted in liquid nitrogen, suspended in 0.4 ml of an extraction buffer 
(1 % SDS, 0. 1 M NaHC0 3 , 5% 2-mercaptoethanol) and boiled for 5 min. Then, the sample 
was centrifuged at 15,000 rpm for 5 min at room temperature to obtain the supernatant as a 
protein extract. This protein extract was separated by SDS-PAGE and transferred onto a nylon 
membrane (Imobilon; Millipore). This membrane was incubated with the above-described 
antibody to CMO (5000-fold dilution of the anti-serum) and washed. The membrane was 
further incubated with a secondary antibody of 3000-fold dilution [affinity-purified goat anti- 
rabbit IgG (H+L) alkaline phosphatase conjugate; BioRad] and washed, followed by detection 
of CMO with a coloring solution (Konica Immunostain HRP-1000; Konica). 

The results of the Western blotting are shown in Fig. 1 . The existence of an immune 
responsive protein of 43 kDa corresponding to CMO was confirmed. In pBIcmo-transferred 
tobacco, it was shown that accumulation of CMO protein from which the transit peptide had 
been removed was increased by several times when 150 mM Nad stress was loaded for 1 day. 
In pBIcmoS-transferred tobacco, no induction of protein accumulation was observed when 150 
mM NaCl stress was loaded. From these results, it was indicated that a polypeptide sequence 
corresponding to the Chenopodium album CMO transit peptide promotes protein accumulation 
in response to salt stress, 

(2) Determination of Betaine Accumulation in Transformed Plants 

Betaine contents in plant leaves were calculated by measuring NMR spectra of quaternary 

3 0 



0 0- 3-24; 7:26PM;^H1fj*$l!l 



OBLON SPIVAK ; 3 5 D3 04 1 4 



# 43 



ammonium compounds (Wall, J. et al., Analyt Chem. 32:870-874, 1960). One gram each of 
leaves from the wild-type plant and the transformed plant was powdered in liquid nitrogen using 
a ceramic motor. The resultant powder was suspended in 4 ml of 1.0 M H 2 S0 4 , which was 
then shaken at 25 t! for24hr. After removal of insoluble matters, the suspension was 
centrifuged at lOOOxg for 10 min to recover a supernatant. To 1 ml of this supernatant, 0.4 ml 
of KI-I 2 solution was added and shaken at 4 °C for 80 min. The resultant solution was 
centrifuged at 13,000xg to thereby recover periodite-addition products of betaine, choline or the 
like. These products were dissolved in 0,6 ml of D20 (EURISO-TOP) containing t-butyl 
alcohol (Nacalai Tesque) as an internal standard, followed by measurement of 1H-NMR spectra. 

As a result, two major peaks of betaine and choline were observed. The integrated value 
for the betaine peak was used for the quantitative determination of betaine concentration. 

Tobacco plants grown for 2 weeks after sowing were transplanted to square-shaped pots 
containing 1/2 MS medium supplemented with 20 mM choline (Nacalai Tesque) and 100 mM 
NaCl. After 2-week cultivation, samples were collected and used for the quantitative 
determination of betaine. 

As a result, while only choline was observed in the wild-type plants, both betaine and 
choline were observed in the transformed plants. As shown in Fig. 2, the betaine content of 
pBIcmo4-2 plant was 2.0 p g/g fresh weight. These results indicated that transgenic plants 
expressing the CMO mature protein have ability to accumulate betaine. 

All publications, patents and patent applications cited herein are incorporated by reference 
in their entirety. 

According to the present invention, choline monooxygenase and the gene thereof are 
provided. The gene is applicable to the breeding of those plants which are highly tolerant to 
dry or saline soil. Also, the creation of such plants using the above gene will be helpful to 
restore plant cultivation in wasteland resulted from dry or saline soil, or to increase the yields of 
crops in areas of dry soil or saline soil. 
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SEQUENCE LISTING 

<110> NISHIMURA, Satoru 
KOIKE, Ayumi 

<120> CMO gene 

<130> PH-744US 

<140> 
<141> 

<150> JP 11-273725 
<151> 27-SEP-1999 

<160> 17 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1828 
<212> DNA 

<213> Chenopodium album 

<220> 

<221> CDS 

<222> (129).. (1427) 

<400> 1 
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gaactataca agctaagtta agcttaagct atattgttga tcatctttca tactacttcc 60 

tttaaaaaaa aaattataac aacaaaagga agtgtgaatt ttttccttga tcatcatata 120 

acatcaat atg gca gca agt gca aca aca atg ttg ctg aaa tac cca aca 170 
Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr 

1 5 10 

act gta tgt ggt ata cca aat tea tea tea aac aat gat act tea aat 218 
Thr Val Cys Gly lie Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn 

15 20 25 30 

aat ata gtc cca att cca caa act agt act aat aat ccg gta ctt aag 266 

Asn lie Val Pro He Pro Gin Thr Ser Thr Asn Asn Pro Val Leu Lys 

35 40 45 

ttt cgt acc cct aat aaa acc att aac gec gtc get gec ccg get ttt 314 

Phe Arg Thr Pro Asn Lys Thr He Asn Ala Val Ala Ala Pro Ala Phe 

50 55 60 

cct tct tta aac acc acc act act ccg ccg teg att caa tea ctt gtc 362 

Pro Ser Leu Asn Thr Thr Thr Thr Pro Pro Ser lie Gin Ser Leu Val 

65 70 75 

cag gaa ttc gat ccg aag att ccg get aag gat get ctt acg cct cct 410 

Gin Glu Phe Asp Pro Lys He Pro Ala Lys Asp Ala Leu Thr Pro Pro 

80 85 90 
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age tct tgg tat act gac get get ttc tat get cat gaa ctt gac cgt 458 
Ser Ser Trp Tyr Thr Asp Ala Ala Phe Tyr Ala His Glu Leu Asp Arg 
95 100 105 110 

ate ttt tat aag gga tgg caa gtc cca ggg tac agt gat caa att aag 506 
He Phe Tyr Lys Gly Trp Gin Val Pro Gly Tyr Ser Asp Gin He Lys 
115 120 125 

gag cct aac caa tat ttc acc gga acg tta gga aat gtt gaa tat ttg 554 
Glu Pro Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr Leu 
130 135 140 

gtg tgt cga gat ggt gaa gga aaa gtt cat gca ttt cac aac gtt tgc 602 
Val Cys Arg Asp Gly Glu Gly Lys Val His Ala Phe His Asn Val Cys 
145 150 155 

acc cat cgt get teg att ctt get tgt gga agt gga aaa aaa teg tgt 650 
Thr His Arg Ala Ser 11$ Leu Ala Cys Gly Ser Gly Lys Lys Ser Cys 
160 165 170 

ttt gtg tgc cct tac cat gga tgg gta ttt ggc atg aat gga teg ctt 698 
Phe Val Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser Leu 
175 180 185 190 

aca aaa get tec aaa gca acc gaa gaa cag tea ctt gat ccc gat gaa 746 
Thr Lys Ala Ser Lys Ala Thr Glu Glu Gin Ser Leu Asp Pro Asp Glu 
195 200 205 
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ctt ggg ctt gta ccc ctg aaa gtt gca gta tgg ggc cca ttt ata etc 794 
Leu Gly Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe He Leu 
210 215 220 

ata agt ttg gac aga tea age ctt gaa gta ggt gat gtt gga tct gaa 842 
He Ser Leu Asp Arg Ser Ser Leu Glu Val Gly Asp Val Gly Ser Glu 
225 230 235 

tgg ctt ggt agt tgt get gaa gat gtt aag gec cat get ttt gac cct 890 
Trp Leu Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp Pro 
240 245 250 

aat tta cag ttc ate aat agg agt gaa ttt cca atg gaa tct aat tgg 938 
Asn Leu Gin Phe He Asn Arg Ser Glu Phe Pro Met Glu Ser Asn Trp 
255 260 265 270 

aag att ttc agt gac aac tat ttg gat age teg tac cat gtt cct tat 986 
Lys He Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro Tyr 
275 280 285 

gca cac aag tac tat get act gaa etc gac ttt gat act tac caa act 1034 
Ala His Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin Thr 
290 295 300 

gat atg ate gga aac gtc acg att caa aga gtg gca ggg agt tea aac 1082 
Asp Met He Gly Asn Val Thr He Gin Arg Val Ala Gly Ser Ser Asn 
305 310 315 
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aat ggt ttt aat aga ctt gga tct caa gca ttc tat get ttt gca tac 1130 

Asn Gly Phe Asn Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala Tyr 
320 325 330 

cct aac ttt get gtg gaa agg tat ggc cct tgg atg aca aca atg cac 1178 

Pro Asn Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met His 

335 340 345 350 

att ctt cca tta gga cca agg aaa tgc aaa tta gtg gtg gac tac tac 1226 

lie Leu Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr Tyr 
355 360 365 

att gaa aaa tea aag ctg gac gac aag gat tac ate gag aag ggc att 1274 

He Glu Lys Ser Lys Leu Asp Asp Lys Asp Tyr He Glu Lys Gly He 
370 375 380 

gca ate aat gat aat gta cag aaa gaa gat gtg gtg ttg tgt gaa agt 1322 

Ala He Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cya Glu Ser 
3S5 390 395 

gtc caa aaa ggg ttg gaa aca cca gca tat cgt agt gga aga tat gtg 1370 

Val Gin Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr Val 
400 405 410 

atg cca att gag aaa gga ate cat cat ttc cac tgc tgg ttg cac caa 1418 

Met Pro He Glu Lys Gly He His His Phe His Cys Trp Leu His Gin 

415 420 425 430 
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gta ttg aag tgattgcagc agatcatcag atgttcgttt cttcttgtat 1467 
Val Leu Lys 

tggaattgga tattatgatt aataagtaaa attataatgt cataatgtag ttgagattgt 1527 
tgctagagtt gagcgtatgc tcctcatgca ctacttagtt atcaagtgtg tatgtctttg 1587 
gtcatgggca aaatgtatgt ttcttgctag aatttatata ttatggtgct aatgtccaat 1647 
ataaataaaa accatagcac ccctttaatt ccctacttag gtttatatcc catttatttt 1707 
cgggggatct atgagataga ttgtctatga acattatttt tcgactcgtg tatggtattc 1767 
atcccttgtg tagggtgaag taaacattga gtgtatgaag ttttcattga gtttctgctt 1827 
t 1828 

. <210> 2 
<211> 433 
<212> PRT 

<213> Chenopodium album 
<400> 2 

Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 

Cys Gly He Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn Asn He 

37 



3-24; 7:26PM;¥«t!Mr 



OBLON SPIVAK 



13503041 4 



20 25 30 

Val Pro lie Pro Gin Thr Ser Thr Asn Asn Pro Val Leu Lys Phe Arg 
35 40 45 

Thr Pro Asn Lys Thr lie Asn Ala Val Ala Ala Pro Ala Phe Pro Ser 
50 55 60 

Leu Asn Thr Thr Thr Thr Pro Pro Ser lie Gin Ser Leu Val Gin Glu 
65 70 75 80 

Phe Asp Pro Lys lie Pro Ala Lys Asp Ala Leu Thr Pro Pro Ser Ser 
85 90 95 

Trp Tyr Thr Asp Ala Ala Phe Tyr Ala His Glu Leu Asp Arg He Phe 
100 105 HO 

Tyr Lys Gly Trp Gin Val Pro Gly Tyr Ser Asp Gin He Lys Glu Pro 
115 120 125 

Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr Leu Val Cys 
130 135 140 

Arg Asp Gly Glu Gly Lys Val His Ala Phe His Asn Val Cys Thr His 
145 150 155 160 

Arg Ala Ser He Leu Ala Cys Gly Ser Gly Lys Lys Ser Cys Phe Val 
165 170 175 
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Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser Leu Thr Lys 
ISO 185 190 

Ala Ser Lys Ala Thr Glu Glu Gin Ser Leu Asp Pro Asp Glu Leu Gly 
195 200 205 

Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe He Leu He Ser 
210 215 220 

Leu Asp Arg Ser Ser Leu Glu Val Gly Asp Val Gly Ser Glu Trp Leu 
225 230 235 240 

Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp Pro Asn Leu 
245 250 255 

Gin Phe He Asn Arg Ser Glu Phe Pro Met Glu Ser Asn Trp Lys He 
260 265 270 

Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro Tyr Ala His 
275 280 285 

Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin Thr Asp Met 
290 295 300 

He Gly Asn Val Thr He Gin Arg Val Ala Gly Ser Ser Asn Asn Gly 
305 310 315 320 
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Phe Asn Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala Tyr Pro Asn 
325 330 335 

Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met His He Leu 
340 345 350 

Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr Tyr lie Glu 
355 360 365 

Lys Ser Lys Leu Asp Asp Lys Asp Tyr lie Glu Lys Gly lie Ala He 
370 375 380 

Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cys Glu Ser Val Gin 
3S5 390 395 400 

Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr Val Met Pro 
1 405 410 415 

He Glu Lys Gly He His His Phe His Cys Trp Leu His Gin Val Leu 
420 425 430 

Lys 



<210> 3 
<211> 1651 
<212> DNA 
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<213> Chenopodium album 

<220> 
<221> CDS 

<222> (119).. (1423) 
<400> 3 

cttgaattat acaagctaag tatatatatt gttgatcatc tttcatacca cctttaaaaa 60 

aaattataac aacaaaagga agtgtttagt tattgcttga tcatcatata atatcaac US 

atg tea gca agt gca aca aca atg ttg ctg aaa tac cca aca act gta 166 
Met Ser Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 

tgt ggt at a cca aat tea tea tea aac aat gat act tea aat aac ate 214 
Cys Gly lie Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn Asn He 
20 25 30 

gtc cca att cca caa act agt act aat aat ccg gta ctt aag ttt cgt 262 
Val Pro He Pro Gin Thr Ser Thr Asn Asn Pro Val Leu Lys Phe Arg 
35 40 45 

acc cct aat aaa acc att aac gee gtc get gec ccg get ttt cct tct 310 
Thr Pro Asn Lys Thr He Asn Ala Val Ala Ala Pro Ala Phe Pro Ser 
50 55 60 

tta agt acc acc act act ccg ccg teg att caa tea ctt gtc cag gaa 358 
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Leu Ser Thr Thr Thr Thr Pro Pro Ser lie Gin Ser Leu Val Gin Glu 

65 70 75 80 

ttc gat ccg agg att ctg gcc gag gat get etc acg cct cct age tct 406 

Phe Asp Pro Arg He Leu Ala Glu Asp Ala Leu Thr Pro Pro Ser Ser 
85 90 95 

tgg tat act gaa cct gcc ttc tat get cat gaa Ctt gac cgt ate ttt 454 

Trp Tyr Thr Glu Pro Ala Phe Tyr Ala His Glu Leu Asp Arg He Phe 

100 105 110 

tac aaa gga tgg caa gtc gca ggg tac age gat caa att aag gag cct 502 

Tyr Lys Gly Trp Gin Val Ala Gly Tyr Ser Asp Gin He Lys Glu Pro 

115 120 125 

aac caa tat ttc acc gga acg tta gga aat gtt gaa tat ttg gtg tgt 550 

Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr Leu Val Cys 

130 135 140 

cga gat ggt gaa gga aaa gtt cat gca ttt cac aat gtt tgc act cat 598 

Arg Asp Gly Glu Gly Lys Val His Ala Phe His Asn Val Cys Thr His 

145 150 . 155 160 

cgt get teg att ctt get tgt gga agt ggc aaa aaa teg tgt ttc gta 646 

Arg Ala Ser He Leu Ala Cys Gly Ser Gly Lys Lys Ser Cys Phe Val 
165 170 175 

tgc cct tac cat ggt tgg gta ttt ggc atg aat gga tea ctt acg aaa 694 
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Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser Leu Thr Lys 
ISO 185 190 

get tec aaa gca acc gaa gaa cag tec ctt gat ccc gat gaa ctt ggg 742 

Ala Ser Lys Ala Thr Glu Glu Gin Ser Leu Asp Pro Asp Glu Leu Gly 
195 200 205 

ctt gta ccc ctg aaa gtt gca gta tgg ggc cca ttt ata etc ate agt 790 

Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe He Leu He Ser 

210 215 220 

ttg gac aga tea age ctt gaa gta ggc gat gtt gga tct gaa tgg ctt 838 

Leu Asp Arg Ser Ser Leu Glu Val Gly Asp Val Gly Ser Glu Trp Leu 

225 230 235 240 

ggt agt tgt get gaa gat gtt aag gec cat get ttt gac cct aat ttg 886 

Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp Pro Asn Leu 

245 250 255 

cag ttc ate aat agg agt gaa ttt cca atg gaa tct aat tgg aag att 934 

Gin Phe He Asn Arg Ser Glu Phe Pro Met Glu Ser Asn Trp Lys lie 
260 265 270 

ttc agt gac aac tac ttg gat age teg tac cat gtt cct tat gca cac 982 

Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro Tyr Ala His 
275 280 285 

aag tac tat gca act gaa etc gac ttt gat act tat caa acc gat atg 1030 
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Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin Thr Asp Met 
290 295 300 

att gga aat gtc acg att caa aga gtg gcg ggg agt tea aac aag cca 1078 
He Gly Asn Val Thr He Gin Arg Val Ala Gly Ser Ser Asn Lys Pro 
305 310 315 320 

gat ggt ttt gat aga ctt gga tct caa gca ttc tat get ttt gca tac 1126 
Asp Gly Phe Asp Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala Tyr 
325 330 335 

cct aac ttt get gtg gaa agg tat ggc cct tgg atg aca aca atg cat 1174 
Pro Asn Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met His 
340 345 350 

att ctt cca tta gga cca aga aaa tgc aaa tta gtg gtg gac tac tat 1222 
He Leu Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr Tyr 
355 360 365 

att gaa aaa tea atg ctg gac gac aag gat tac ate gag aag ggc ata 1270 
He Glu Lys Ser Met Leu Asp Asp Lys Asp Tyr He Glu Lys Gly He 
370 375 380 

gca ate aat gat aat gta cag aaa gaa gat gtg gtg ttg tgt gaa agt 1318 
Ala He Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cys Glu Ser 
385 390 395 400 

gtc caa aaa ggg ttg gag aca cca gca tat cgt agt gga aga tat gtg 1366 
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Val Gin Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr Val 
405 410 415 

atg cca att gag aaa gga ate cat cat ttc cac tgt tgg ttg cac caa 1414 
Met Pro lie Glu Lys Gly He His His Phe His Cys Trp Leu His Gin 
420 425 430 

gta ttg aag tgatagcagc agatcagatg ttcgtttctt aatttccttt 1463 
Val Leu Lys 
435 

tattggaact ggataattat aataataata agtaaaaaag taaaattata atgtcatgta 1523 
gttgagattg ttgctagagt tgagcgtatg ctcctcatgc acttagttat caagtgtgta 1583 
tgtgtttggt catggacaaa atgtttcttg ctagaattta tcatattata aggtgctaat 1643 
gtccaata 1651 



<210> 4 
<211> 435 
<212> PRT 

<213> Chenopodiura album 
<400> 4 

Met Ser Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 
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Cys Gly lie Pro Asn Ser Ser Ser Asn A$n Asp Thr Ser Asn Asn He 
20 25 30 

Val Pro He Pro Gin Thr Ser Thr Asn Asn Pro Val Leu Lys Phe Arg 
35 40 45 

Thr Pro Asn Lys Thr He Asn Ala Val Ala Ala Pro Ala Phe Pro Ser 
50 55 60 

Leu Ser Thr Thr Thr Thr Pro Pro Ser He Gin Ser Leu Val Gin Glu 
65 70 75 80 

Phe Asp Pro Arg He Leu Ala Glu Asp Ala Leu Thr Pro Pro Ser Ser 
85 90 95 

Trp Tyr Thr Glu Pro Ala Phe Tyr Ala His Glu Leu Asp Arg He Phe 
100 105 110 

Tyr Lys Gly Trp Gin Val Ala Gly Tyr Ser Asp Gin He Lys Glu Pro 
115 120 125 

Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr Leu Val Cys 
130 135 140 

Arg Asp Gly Glu Gly Lys Val His Ala Phe His Asn Val Cys Thr His 
145 150 155 160 
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Arg Ala Ser lie Leu Ala Cys Gly Ser Gly Lys Lys Ser Cys Phe Val 
165 170 175 

Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser Leu Thr Lys 
180 185 190 

Ala Ser Lys Ala Thr Glu Glu Gin Ser Leu Asp Pro Asp Glu Leu Gly 
195 200 205 

Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe He Leu He Ser 
210 215 220 

Leu Asp Arg Ser Ser Leu Glu Val Gly Asp Val Gly Ser Glu Trp Leu 
225 230 235 240 

Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp Pro Asn Leu 
245 250 255 

Gin Phe He Asn Arg Ser Glu Phe Pro Met Glu Ser Asn Trp Lys He 
260 265 270 

Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro Tyr Ala His 
275 280 285 

Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin Thr Asp Met 
290 295 300 

He Gly Asn Val Thr He Gin Arg Val Ala Gly Ser Ser Asn Lys Pro 
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305 310 315 320 

Asp Gly Phe Asp Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala Tyr 
325 330 335 

Pro Asn Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met His 
340 345 350 

He Leu Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr Tyr 
355 360 365 

He Glu Lys Ser Met Leu Asp Asp Lys Asp Tyr He Glu Lys Gly He 
370 375 380 

Ala He Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cys Glu Ser 
385 390 395 400 

Val Gin Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr Val 
405 410 415 

Met Pro He Glu Lys Gly He His His Phe His Cys Trp Leu His Gin 
420 425 430 

Val Leu Lys 
435 



<210> 5 
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<211> 1712 
<212> DNA 

<213> Chenopodium album 

<220> 
<221> CDS 

<222> (133). . (1431) 
<400> 5 

Cttgaattac acaagctaag ttaagctaag ctatattgtt gatcatcttt cataccacct 60 

cctttaaaaa aaaaaaatta taacaacaaa aggaagtgtt tagttattgc ttgatcatca 120 

tataacatca at atg gca gca agt gca aca aca atg ttg ctg aaa tac cca 171 
Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro 
1 5 10 

aca act gta tgt ggt ata cca aat tea tea tea aac aat gat act tea 219 
Thr Thr Val Cys Gly He Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser 
15 20 25 

aat aac ate gtc cca att cca caa act att act aat aat ccg gta ctt 267 
Asn Asn He Val Pro He Pro Gin Thr He Thr Asn Asn Pro Val Leu 
30 35 40 45 

aag ttt cgt acc cct aat aaa acc att aac gec gtc get gec ccg get 315 
Lys Phe Arg Thr Pro Asn Lys Thr He Asn Ala Val Ala Ala Pro Ala 
50 55 60 
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ttt cct tct tta aac acc acc act act ccg ccg tea att caa tea ctt 363 
Phe Pro Ser Leu Asn Thr Thr Thr Thr Pro Pro Ser He Gin Ser Leu 
65 70 75 

gtc cag gaa ttc gat ccg agg att ccg gec gag gat get ctt acg cct 411 
Val Gin Glu Phe Asp Pro Arg He Pro Ala Glu Asp Ala Leu Thr Pro 
80 85 90 

cct age tct tgg tat act gaa cct get ttc tat get cat gaa ctt gac 459 
Pro Ser Ser Trp Tyr Thr Glu Pro Ala Phe Tyr Ala His Glu Leu Asp 
95 100 105 

cgt ate ttt tac aag gga tgg caa gtc gca ggg tac agt gat caa att 507 
Arg He Phe Tyr Lys Gly Trp Gin Val Ala Gly Tyr Ser Asp Gin He 
110 115 120 125 

aag gag cct aac caa tat ttc acc gga acg tta gga aat gtt gaa tat 555 
Lys Glu Pro Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr 
130 135 140 

ttg gtg tgt cga gat ggt gaa ggt aaa gtt cat gca ttt cac aac gtt 603 
Leu Val Cys Arg Asp Gly Glu Gly Lys Val His Ala Phe His Asn Val 
145 150 155 

tgc acc cat cgt get teg att ctt get tgt gga age gga aaa aaa teg 651 
Cys Thr His Arg Ala Ser He Leu Ala Cys Gly Ser Gly Lys Lys Ser 
160 165 170 
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tgt ttt gta tgc cct tac cat gga tgg gta ttt ggc atg aat gga teg 699 
Cys Phe Val Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser 
175 180 185 

ctt aca aaa get tec aaa gca age gaa gaa cag tea ctt gat ccc gat 747 
Leu Thr Lys Ala Ser Lys Ala' Ser Glu Glu Gin Ser Leu Asp Pro Asp 
190 195 200 205 

gaa ctt ggg ctt gta ccc ctg aaa gtt gca gta tgg ggc cca ttt ata 795 
Glu Leu Gly Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe lie 
210 215 220 

etc ate agt ttg gac aga tea age ctt gaa gta gat gat gtt gga tct 843 
Leu lie Ser Leu Asp Arg Ser Ser Leu Glu Val Asp Asp Val Gly Ser 
225 230 235 

gaa tgg ctt ggt agt tgt get gaa gat gtt aag gee cat get ttt gac 891 
Glu Trp Leu Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp 
240 245 250 

cct aat ttg cag ttc ate aat agg agt gaa ttt cca atg gaa tct aat 939 
Pro Asn Leu Gin Phe He Asn Arg Ser Glu Phe Pro Met Glu Ser Asn 
255 260 265 

tgg aag att ttc agt gac aac tat ttg gat age teg tac cat gtt cct 987 
Trp Lys He Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro 
270 275 280 285 
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tat gca cac aag tac tat get act gaa etc gac ttt gat act tac caa 1035 
Tyr Ala His Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin 
290 295 300 

act gat atg ate gga aat gtc acg att caa aga gtg gca ggg agt tea 1083 
Thr Asp Met lie Gly Asn Val Thr He Gin Arg Yal Ala Gly Ser Ser 
305 310 315 

aac aat ggt ttt aat aga ctt gga tct caa gca ttc tac get ttt gca 1131 
Asn Asn Gly Phe Asn Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala 
320 325 330 

tac cct aac ttt get gtg gaa agg tat ggc cct tgg atg aca aca atg 1179 
Tyr Pro Asn Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met 
335 340 345 

cac att ctt cca tta gga cca agg aaa tgc aaa tta gtg gtg gac tac 1227 
His He Leu Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr 
350 355 360 365 

tat att gaa aaa tea aag ctg gac gac aag gat tac ate gag aag ggc 1275 
Tyr He Glu Lys Ser Lys Leu Asp Asp Lys Asp Tyr He Glu Lys Gly 
370 375 380 

ata gca ate aat gat aat gta cag aaa gaa gat gtg gtg ttg tgt gaa 1323 
He Ala He Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cys Glu 
385 390 395 
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agt gtc caa aaa ggg ttg gag aca cct gcg tat cgt agt gga aga tat 1371 
Ser Val Gin Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr 
400 405 410 

gtg atg cca att gag aaa gga ate cat cat ttc cac tgt tgg ttg cac 1419 
Val Met Pro He Glu Lys Gly He His His Phe His Cys Trp Leu His 
415 420 425 

caa gta ttg aag tgattgeage agatcagatg ttcgtttctt aatttccttt 1471 

Gin Val Leu Lys 

430 

tattggaatt ggatgattgt tataataata agtaaaatta taatgtcatg tagttgagat 1531 
tgttgctaga gttgagcgta tgctcctcat gcacttagtt atcaagtgtg tatgtgtttg 1591 
gtcatgggca aaatgtattt tettgetaga atttgttata ttatggtgct aatgtccaat 1651 
aatataaata acaccattgc accctttccc tacttgagaa attatatccc atttattttc 1711 
a 1712 



<210> 6 
<211> 433 
<212> PRT 

<213> Chenopodium album 
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<400> 6 

Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 

Cys Gly lie Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn Asn He 
20 25 30 

Val Pro He Pro Gin Thr He Thr Asn Asn Pro Val Leu Lys Phe Arg 
35 40 45 

Thr Pro Asn Lys Thr He Asn Ala Val Ala Ala Pro Ala Phe Pro Ser 
50 55 60 

Leu Asn Thr Thr Thr Thr Pro Pro Ser He Gin Ser Leu Val Gin Glu 
65 70. 75 80 

Phe Asp Pro Arg He Pro Ala Glu Asp Ala Leu Thr Pro Pro Ser Ser 
85 90 95 

Trp Tyr Thr Glu Pro Ala Phe Tyr Ala His Glu Leu Asp Arg He Phe 
100 105 HO 

Tyr Lys Gly Trp Gin Val Ala Gly Tyr Ser Asp Gin He Lys Glu Pro 
115 120 125 

Asn Gin Tyr Phe Thr Gly Thr Leu Gly Asn Val Glu Tyr Leu Val Cys 
130 135 140 
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Arg Asp Gly Glu Gly Lys Yal His Ala Phe His Asn Val Cys Thr His 
145 150 155 160 

Arg Ala Ser He Leu Ala Cys Gly Ser Gly Lys Lys Ser Cys Phe Val 
165 170 175 

Cys Pro Tyr His Gly Trp Val Phe Gly Met Asn Gly Ser Leu Thr Lys 
180 185 190 

Ala Ser Lys Ala Ser Glu Glu Gin Ser Leu Asp Pro Asp Glu Leu Gly 
195 200 205 

Leu Val Pro Leu Lys Val Ala Val Trp Gly Pro Phe He Leu He Ser 
210 215 220 

Leu Asp Arg Ser Ser Leu Glu Val Asp Asp Val Gly Ser Glu Trp Leu 
225 230 235 240 

Gly Ser Cys Ala Glu Asp Val Lys Ala His Ala Phe Asp Pro Asn Leu 
245 250 255 

Gin Phe lie Asn Arg Ser Glu Phe Pro Met Glu Ser Asn Trp Lys He 
260 265 270 

Phe Ser Asp Asn Tyr Leu Asp Ser Ser Tyr His Val Pro Tyr Ala His 
275 280 285 
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Lys Tyr Tyr Ala Thr Glu Leu Asp Phe Asp Thr Tyr Gin Thr Asp Met 
290 295 300 

lie Gly Asn Val Thr He Gin Arg Val Ala Gly Ser Ser Asn Asn Gly 
305 310 315 320 

Phe Asn Arg Leu Gly Ser Gin Ala Phe Tyr Ala Phe Ala Tyr Pro Asn 
325 330 335 

Phe Ala Val Glu Arg Tyr Gly Pro Trp Met Thr Thr Met His He Leu 
340 345 350 

Pro Leu Gly Pro Arg Lys Cys Lys Leu Val Val Asp Tyr Tyr He Glu 
355 360 365 

Lys Ser Lys Leu Asp Asp Lys Asp Tyr He Glu Lys Gly He Ala lie 
370 375 380 

Asn Asp Asn Val Gin Lys Glu Asp Val Val Leu Cys Glu Ser Val Gin 
385 390 395 400 

Lys Gly Leu Glu Thr Pro Ala Tyr Arg Ser Gly Arg Tyr Val Met Pro 
405 410 415 

He Glu Lys Gly He His His Phe His Cys Trp Leu His Gin Val Leu 
420 425 430 

Lys 
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<210> 7 
<2U> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<221> modified_base 
<222> 9 

<223> n represents a, g, c or 
<220> 

<221> modified_base 
<222> 15 

<223> n represents a, g, c or 
<220> 

<221> modified_base 
<222> IS 

<223> n represents a, g, c or 
<400> 7 

tggtayacng arccngcntt yta 
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<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<221> modified_base 
<222> 9 

<223> n represents a, g, c or t 
<220> 

<221> modified_base 
<222> 15 

<223> n represents a, g, c or t 
<220> 

<221> modified_base 
<222> IS 

<223> n represents a, g 3 c or t 
<400> 8 

tayttrtgng crtanggnac rtgrta 
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<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 9 

gtgcattgtt gtcatccaag ggcc 



<210> 10 
<2X1> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 10 

gatcccgatg aacttgggct tgtacccc 28 



<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic DNA 
<400> 11 

cccgggttta gttattgctt gatcat 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 12 

gagctcctgc aatcacttca atactt 

<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
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<400> 13 

taatggatcc attaacgccg tcgc 

<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 14 

gggtaecaat cacttcaata cttgg 

<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 15 

cccgggaaaa ccattatggc cgtcgc 



61 



00- 3-24; 



OBLON SPIVAK 



,'3 5 03 04 1 4 



# 74 



<210> 16 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<220> 

<221> CDS 

<222> (1).. (126) 

<400> 16 

atg gca gca agt gca aca aca atg ttg ctg aaa tac cca aca act gta 48 
Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 

tgt ggt ata cca aat tea tea tea aac aat gat act tea aat aac ate 96 
Cys Gly lie Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn Asn lie 



20 



25 



30 



gtc cca att cca caa act att act aat aat c 



127 



Val Pro He Pro Gin Thr He Thr Asn Asn 



35 



40 



<210> 17 



<211> 42 
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<212> PRT 

<213> Artificial Sequence 
<400> 17 

Met Ala Ala Ser Ala Thr Thr Met Leu Leu Lys Tyr Pro Thr Thr Val 
15 10 15 

Cys Gly lie Pro Asn Ser Ser Ser Asn Asn Asp Thr Ser Asn Asn lie 
20 25 30 

Val Pro He Pro Gin Thr He Thr Asn Asn 
35 40 

SEQUENCE LISTING FREE TEXT 

SEQ ID NO:7: n represents a,g,c or t (location; 9). 

SEQ ID NO:7: n represents a,g,c or t (location: 15). 

SEQ ID NO:7: n represents a ? g,c or t (location: 18). 

SEQ ID NO:8: n represents ajgjC or t (location: 9). 

SEQ ID NO:8: n represents a,g,c or t (location: 15). 

SEQ ID NO:8: n represents a,g,c or t (location: 1 8). 
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What is claimed is: 

1 . The following recombinant protein (a) or (b): 

(a) a protein comprising the amino acid sequence shown in SEQ ID NO: 2, 4 or 6; 

(b) a protein which comprises the ammo acid sequence shown in SEQ ID NO: 2, 4 or 6 
having deletion, substitution or addition of one or several amino acids, and which has choline 
monooxygenase activity. 

2. A choline monooxygenase gene encoding the following protein (a) or (b): 

(a) a protein comprising the amino acid sequence shown in SEQ ID NO: 2, 4 or 6; 

(b) a protein which comprises the amino acid sequence shown in SEQ ID NO: 2, 4 or 6 
having deletion, substitution or addition of one or several amino acids and which has choline 
monooxygenase activity. 

3 . A gene comprising the following DNA (c) or (d): 

(c) a DNA comprising the nucleotide sequence shown in SEQ ID NO: 1, 3 or 5; 

(d) a DNA which hybridizes to a DNA comprising the nucleotide sequence shown in 
SEQ ID NO: 1, 3 or 5 under stringent conditions and which encodes a protein having choline 
monooxygenase activity. 

4. A recombinant vector comprising the gene according to claim 2 or 3 . 

5. A transformant comprising the recombinant vector according to claim 4. 

6. A method for producing a choline monooxygenase, comprising culturing the transformant 
according to claim 5 and recovering the choline monooxygenase from the resultant culture. 

7. The following peptide (e) or (f): 
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(e) a peptide comprising the amino acid sequence shown in SEQ 3D NO: 17; 

(f) a peptide which comprises the amino acid sequence shown in SEQ ID NO: 17 having 
deletion, substitution or addition of one or several amino acids and which has signal peptide 
activity; or a salt thereof. 

8. A gene encoding the following peptide (e) or (f): 

(e) a peptide comprising the amino acid sequence shown in SEQ ID NO: 17; 

(f) a peptide which comprises the amino acid sequence shown in SEQ ID NO: 17 having 
deletion, substitution or addition of one or several amino acids and which has signal peptide 
activity. 

9. A gene comprising the following DNA (g) or (h); 

(g) a DNA comprising the nucleotide sequence shown in SEQ ID NO: 16; 

(h) a DNA which hybridizes to a DNA comprising the nucleotide sequence shown in 
SEQ ID NO: 16 under stringent conditions and which encodes a protein having signal peptide 
activity. 

10. A recombinant vector comprising the gene according to claim 8 or 9 and a gene of interest 

11. The recombinant vector according to claim 1 0 3 wherein the gene of interest leads to 
production of a polypeptide or production of a plant metabolite. 

12. The recombinant vector according to claim 1 0, wherein the polypeptide or the plant 
metabolite confers stress resistance. 

13. The recombinant vector according to claim 10, wherein the gene of interest is 
Chenopodium album choline monooxygenase gene. 
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14. A transfonnant comprising the recombinant vector according to any one of claims 10 to 
13. 

1 5 . The transformant according to claim 1 4, which is a plant body, plant organ, plant tissue or 
cultured plant cell. 

1 6. An environmental stress-resistant plant which is obtained by culturing or cultivating a 
transformed plant comprising the recombinant vector according to claim 12 or 13 under 
environmental stress conditions. 

17. The plant according to claim I6 } wherein the environmental stress is salt stress. 

18. A method for creating an environmental stress-resistant plant, comprising culturing or 
cultivating a transformed plant comprising the recombinant vector according to claim 12 or 13 
under environmental stress conditions. 

19. A method for inducing accumulation of a polypeptide or a plant metabolite, comprising 
culturing or cultivating the transformant according to claim 14 or 15 under environmental stress 
conditions. 

20. The method according to claim 19, wherein the plant metabolite is a substance which 
confers environmental stress resistance, 

2 1 . The method according to claim 20, wherein the substance which confers environmental 
stress resistance is betaine. 

22. A method for producing betaine, comprising culturing or cultivating a transformant 
comprising the recombinant vector according to claim 13 and recovering betaine from the 
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resultant culture or cultivated product. 
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ABSTRACT 

According to the present invention, plant choline monooxygenase and the gene thereof are 
provided. The present invention discloses the following recombinant proteins (a) and (b) as 
well as genes encoding the proteins: 

(a) a protein comprising the amino acid sequence shown in SEQ ID NO: 2, 4 or 6; 

(b) a protein which comprises the amino acid sequence shown in SEQ ID NO: 2, 4 or 6 
having deletion, substitution or addition of one or several amino acids, and which has choline 
monooxygenase activity. 
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43^>- - - rtff^' ? No Treatment 

43kDa>- %alt Treatment 

FIG. 1 Salt Stress Responsivity of CMO Transit Peptide 
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Attorney's Docket No.:_ 

DECLARATION, POWER OF ATTORNEY AND PETITION 

I (We), the undersigned inventor(s), hereby declare that: 

My residence, post office address and citizenship are as stated below next to my 
name, 

I (We) believe that I am (we are) the original, first, and joint (sole) 
in vcntor{s) of the subject matter which is claimed and for wh tch a patent is 
sought on the invention entitled 

CHOLINE MQNOOXYGETSTASE GENE 



the specification of which 

■ is attached hereto. 
□ was filed on _ 



Application Serial .NTn 
and amended on 



□ was filed as PCT international application 
Number 

o n 



and was amended under PCT Article 19 

on_ _ fif applicable). 



I (We) hereby state that I (We) have reviewed and understand the contents of 
the above-iden tified specification, including the claims, as amended by any 
amendmen t referred to above; th at I (We) do n ot know and do n ot believe that 
this invention was ever known Or used before my invention or discovery 
thereof. Or patented or described in any printed publication in any country 
before my invention or discovery thereof, or more than one year prior to this 
application , or in public use or on sate in th e Un i Led States for more than one 
year prior to this application; that this invention or discovery has not been 
patented or made the subject of an inventor's certificate in any country 
foreign to the United States on an application filed by me ot my legal 
representatives or assigns more than twelve months before this application. 
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I (We) acknowledge the duty to disclose information known to be material to 
the patentability of this application as defined in Section 1.56 of Title 37 Code of 
Federal Regulations. 

I (We) hereby claim foreign priority benefits under Section 119(a)-(d) of Title 
35 United States Code, of any foreign application (s) for patent or inventor's 
certificate listed below and have also identified below any foreign application 
for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed: 

Priority 

Application No. Country Filing date claimed 

273275/1999 Japan September 27. 1999 ■ Yes □ No 

„ □ Yes D No 

□ Yes □ No 

__. . □ Yes □ No 

I hereby claim the benefit under Section 119(e) of Title 35 United States Code, 
of any United States application(s) listed below. 



(Application Number) (Filing Date) 



(Application Number) (Filing Date) 

I (We) hereby claim the benefit under Section 120 of Title 35 United States Code, 
of any United States application (s) listed below and, insofar as the subject 
matter of each of the claims of this application is not disclosed in the prior 
United States application in the manner provided by the first paragraph of 
Section 112 of Title 35 United States Code, I (We) acknowledge the duty to 
disclose material information as defined in Section 1.56(a) of Title 37 Code of 
Federal Regulations, which occurred between the filing date of the prior 
application and national or PCT international filing date of this application: 
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Application Serial No. 



Filing Date 



Status (pending, 
patented, 
abandoned) 



And I (We) hereby appoint: Norman F. Obion, Registration No. 24,6 1 8; Marvin J. Spivak, 
Registration No. 24,913; C. Irvin McClelland, Registration No. 21,124; Gregory J, Maier, Registration No. 
25,599; Arthur 1. Ncustadt, Registration No. 24.854; Robert C Miller, Registration No. 25,357; Richard 
D. Kelly, Registration No. 27,757; James D. Hamilton, Registration No. 28,421; Eckhard H. Kuesters, 
Registration No. 28,870; Robert T. Pous. Registration No. 29,099; Charles L. Ghol2, Registration No. 
26,395; Vincent J. Sunderdick, Registration No, 29,004; William E.Beaumont, Registration No- 30.996; 
Steven B. Kelbcr, Registration No. 30,073: Stuart D. Dwork, Registration No. 31,103; Robert F. Gnuse, 
Registration No. 27,295; Jean-Paul La valley e, Registration No.31,451; William B.Walker, Registration 
No. 22.493; Timothy R. Schwartz, Registration No. 32,171 ; Stephen G, Baxter, Registration No. 32.884; 
Gilberto M. Villacorta, Registration No. 34,038; and John H.O.Clarke, Registration No, 17,373, 
I(We) hereby request that all correspondence regarding this application be sent to the firm of OBLON, 
SFIVAK, McCLELLAND, MAIER Sc. NEUSTADT. P.C. whose Post office address is: Fourth Floor, 1755 
Jefferson Davis Highway, Arlington, Virginia 22202 U.S.A. 

1 (We) declare further th at all statements made herein of my (our) knowledge 
are true and that all statements were made with the knowledge that willful 
false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section lOOl of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 

Satora Nishirrmra Residence: Tovota-shi. Atchi. Japan 

NAME OF FIRST SOLE INVENTOR 




Citizen of: Japan 
Post Office Address: 



Signature of Inventor 



March 22 , 2000 



Date 



c/o TOYOTA JIDQSHA KABUSHIKI 
KAISHA. 1. Tovota-cho. Tovota-shi 
Aichi 471-8571 Japan 



DO- 3-24; 7:26PM;f«^$iffi 



OBLON SPIVAK 



f 3 5 03 0 4 1 4 



# 12 



Avumi Koike 

NAME OF SECOND JOINT INVENTOR 

Signature of Inventor 

March 22, 2000 

Date 



Residence: Yokohama-shi. 
Kanagawa. Japan ^ 



Citizen of: Japan __ 

Post Office Address: . 

c/o TOYOTA JTDOSHA KABUSHIKT 
KAJSHA. 1. Tovota-cho, Tovota-shi. 
Aichi 471-8571 Japan 



— . , _ . Residence: 

NAME OF THIRD JOINT INVENTOR 



— . . _ Citizen of; _____ 

S i gn at lire of Inventor Post Office Address : 



Date 



_ ._ Residence: 

NAME OF FOURTH JOINT INVENTOR 



_ . . Citizen of: 

Signature of Inventor Post Office Address: 



Date 



